Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 18723 |
| Missing cells | 74944 |
| Missing cells (%) | 20.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 10.8 MiB |
| Average record size in memory | 607.3 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 7 |
| Unsupported | 4 |
survey_id has constant value "1476" | Constant |
city has constant value "Amsterdam" | Constant |
name has a high cardinality: 18150 distinct values | High cardinality |
last_modified has a high cardinality: 18723 distinct values | High cardinality |
location has a high cardinality: 18723 distinct values | High cardinality |
room_id is highly overall correlated with reviews | High correlation |
reviews is highly overall correlated with room_id and 1 other fields | High correlation |
overall_satisfaction is highly overall correlated with reviews | High correlation |
accommodates is highly overall correlated with bedrooms and 1 other fields | High correlation |
bedrooms is highly overall correlated with accommodates and 1 other fields | High correlation |
price is highly overall correlated with accommodates and 1 other fields | High correlation |
latitude is highly overall correlated with neighborhood | High correlation |
longitude is highly overall correlated with neighborhood | High correlation |
neighborhood is highly overall correlated with latitude and 1 other fields | High correlation |
room_type is highly imbalanced (52.9%) | Imbalance |
country has 18723 (100.0%) missing values | Missing |
borough has 18723 (100.0%) missing values | Missing |
bathrooms has 18723 (100.0%) missing values | Missing |
minstay has 18723 (100.0%) missing values | Missing |
name is uniformly distributed | Uniform |
last_modified is uniformly distributed | Uniform |
location is uniformly distributed | Uniform |
room_id has unique values | Unique |
last_modified has unique values | Unique |
location has unique values | Unique |
country is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
borough is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
bathrooms is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
minstay is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
reviews has 2984 (15.9%) zeros | Zeros |
overall_satisfaction has 5748 (30.7%) zeros | Zeros |
bedrooms has 1154 (6.2%) zeros | Zeros |
Reproduction
| Analysis started | 2023-06-18 11:26:53.024779 |
|---|---|
| Analysis finished | 2023-06-18 11:27:14.071455 |
| Duration | 21.05 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
room_id
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 18723 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11205678 |
| Minimum | 2818 |
|---|---|
| Maximum | 20003728 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 146.4 KiB |
Quantile statistics
| Minimum | 2818 |
|---|---|
| 5-th percentile | 1018799.4 |
| Q1 | 6050607.5 |
| median | 12282874 |
| Q3 | 16610843 |
| 95-th percentile | 19578785 |
| Maximum | 20003728 |
| Range | 20000910 |
| Interquartile range (IQR) | 10560236 |
Descriptive statistics
| Standard deviation | 6082192.3 |
|---|---|
| Coefficient of variation (CV) | 0.54277771 |
| Kurtosis | -1.2261316 |
| Mean | 11205678 |
| Median Absolute Deviation (MAD) | 5236951 |
| Skewness | -0.25429781 |
| Sum | 2.0980391 × 1011 |
| Variance | 3.6993063 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10176931 | 1 | < 0.1% |
| 7050078 | 1 | < 0.1% |
| 16448872 | 1 | < 0.1% |
| 11316029 | 1 | < 0.1% |
| 13561991 | 1 | < 0.1% |
| 10580396 | 1 | < 0.1% |
| 3957325 | 1 | < 0.1% |
| 8590437 | 1 | < 0.1% |
| 2180810 | 1 | < 0.1% |
| 14723562 | 1 | < 0.1% |
| Other values (18713) | 18713 |
| Value | Count | Frequency (%) |
| 2818 | 1 | |
| 3209 | 1 | |
| 20168 | 1 | |
| 25428 | 1 | |
| 25488 | 1 | |
| 27886 | 1 | |
| 28658 | 1 | |
| 28871 | 1 | |
| 29051 | 1 | |
| 29554 | 1 |
| Value | Count | Frequency (%) |
| 20003728 | 1 | |
| 19996091 | 1 | |
| 19995673 | 1 | |
| 19995327 | 1 | |
| 19995246 | 1 | |
| 19995106 | 1 | |
| 19994262 | 1 | |
| 19992677 | 1 | |
| 19992596 | 1 | |
| 19992241 | 1 |
survey_id
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 1476 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 74892 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1476 |
|---|---|
| 2nd row | 1476 |
| 3rd row | 1476 |
| 4th row | 1476 |
| 5th row | 1476 |
Common Values
| Value | Count | Frequency (%) |
| 1476 | 18723 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1476 | 18723 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 18723 | |
| 4 | 18723 | |
| 7 | 18723 | |
| 6 | 18723 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 74892 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 18723 | |
| 4 | 18723 | |
| 7 | 18723 | |
| 6 | 18723 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 74892 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 18723 | |
| 4 | 18723 | |
| 7 | 18723 | |
| 6 | 18723 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 74892 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 18723 | |
| 4 | 18723 | |
| 7 | 18723 | |
| 6 | 18723 |
host_id
Real number (ℝ)
| Distinct | 15943 |
|---|---|
| Distinct (%) | 85.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35776117 |
| Minimum | 2234 |
|---|---|
| Maximum | 1.4183192 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 146.4 KiB |
Quantile statistics
| Minimum | 2234 |
|---|---|
| 5-th percentile | 1477396.3 |
| Q1 | 7140879 |
| median | 19886414 |
| Q3 | 52026801 |
| 95-th percentile | 1.2189162 × 108 |
| Maximum | 1.4183192 × 108 |
| Range | 1.4182968 × 108 |
| Interquartile range (IQR) | 44885922 |
Descriptive statistics
| Standard deviation | 37581026 |
|---|---|
| Coefficient of variation (CV) | 1.0504501 |
| Kurtosis | 0.49010902 |
| Mean | 35776117 |
| Median Absolute Deviation (MAD) | 15783470 |
| Skewness | 1.2438812 |
| Sum | 6.6983623 × 1011 |
| Variance | 1.4123335 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 48703385 | 93 | 0.5% |
| 113977564 | 88 | 0.5% |
| 1464510 | 71 | 0.4% |
| 107745142 | 64 | 0.3% |
| 84453740 | 61 | 0.3% |
| 65859990 | 54 | 0.3% |
| 517215 | 52 | 0.3% |
| 46691672 | 43 | 0.2% |
| 84449589 | 37 | 0.2% |
| 669178 | 36 | 0.2% |
| Other values (15933) | 18124 |
| Value | Count | Frequency (%) |
| 2234 | 1 | |
| 3159 | 1 | |
| 3806 | 1 | |
| 5988 | 2 | |
| 7924 | 1 | |
| 12085 | 1 | |
| 20405 | 1 | |
| 34080 | 1 | |
| 36701 | 1 | |
| 40786 | 1 |
| Value | Count | Frequency (%) |
| 141831915 | 1 | < 0.1% |
| 141749109 | 1 | < 0.1% |
| 141747815 | 1 | < 0.1% |
| 141665148 | 4 | |
| 141658022 | 1 | < 0.1% |
| 141648682 | 1 | < 0.1% |
| 141551211 | 1 | < 0.1% |
| 141548705 | 1 | < 0.1% |
| 141542351 | 1 | < 0.1% |
| 141534602 | 1 | < 0.1% |
room_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.3 MiB |
| Entire home/apt | |
|---|---|
| Private room | |
| Shared room | 63 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.396571 |
| Min length | 11 |
Characters and Unicode
| Total characters | 269547 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Shared room |
|---|---|
| 2nd row | Shared room |
| 3rd row | Shared room |
| 4th row | Shared room |
| 5th row | Shared room |
Common Values
| Value | Count | Frequency (%) |
| Entire home/apt | 14978 | |
| Private room | 3682 | 19.7% |
| Shared room | 63 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| entire | 14978 | |
| home/apt | 14978 | |
| room | 3745 | 10.0% |
| private | 3682 | 9.8% |
| shared | 63 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 33701 | |
| t | 33638 | |
| o | 22468 | |
| r | 22468 | |
| a | 18723 | 6.9% |
| 18723 | 6.9% | |
| m | 18723 | 6.9% |
| i | 18660 | 6.9% |
| h | 15041 | 5.6% |
| p | 14978 | 5.6% |
| Other values (7) | 52424 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 217123 | |
| Space Separator | 18723 | 6.9% |
| Uppercase Letter | 18723 | 6.9% |
| Other Punctuation | 14978 | 5.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 33701 | |
| t | 33638 | |
| o | 22468 | |
| r | 22468 | |
| a | 18723 | |
| m | 18723 | |
| i | 18660 | |
| h | 15041 | |
| p | 14978 | |
| n | 14978 | |
| Other values (2) | 3745 | 1.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 14978 | |
| P | 3682 | 19.7% |
| S | 63 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 18723 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 14978 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 235846 | |
| Common | 33701 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 33701 | |
| t | 33638 | |
| o | 22468 | |
| r | 22468 | |
| a | 18723 | |
| m | 18723 | |
| i | 18660 | |
| h | 15041 | |
| p | 14978 | |
| E | 14978 | |
| Other values (5) | 22468 |
Common
| Value | Count | Frequency (%) |
| 18723 | ||
| / | 14978 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 269547 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 33701 | |
| t | 33638 | |
| o | 22468 | |
| r | 22468 | |
| a | 18723 | 6.9% |
| 18723 | 6.9% | |
| m | 18723 | 6.9% |
| i | 18660 | 6.9% |
| h | 15041 | 5.6% |
| p | 14978 | 5.6% |
| Other values (7) | 52424 |
country
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 18723 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 146.4 KiB |
city
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| Amsterdam |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 168507 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Amsterdam |
|---|---|
| 2nd row | Amsterdam |
| 3rd row | Amsterdam |
| 4th row | Amsterdam |
| 5th row | Amsterdam |
Common Values
| Value | Count | Frequency (%) |
| Amsterdam | 18723 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| amsterdam | 18723 |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 37446 | |
| A | 18723 | |
| s | 18723 | |
| t | 18723 | |
| e | 18723 | |
| r | 18723 | |
| d | 18723 | |
| a | 18723 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 149784 | |
| Uppercase Letter | 18723 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 37446 | |
| s | 18723 | |
| t | 18723 | |
| e | 18723 | |
| r | 18723 | |
| d | 18723 | |
| a | 18723 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 18723 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 168507 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 37446 | |
| A | 18723 | |
| s | 18723 | |
| t | 18723 | |
| e | 18723 | |
| r | 18723 | |
| d | 18723 | |
| a | 18723 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 168507 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| m | 37446 | |
| A | 18723 | |
| s | 18723 | |
| t | 18723 | |
| e | 18723 | |
| r | 18723 | |
| d | 18723 | |
| a | 18723 |
borough
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 18723 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 146.4 KiB |
neighborhood
Categorical
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.3 MiB |
| De Baarsjes / Oud West | |
|---|---|
| De Pijp / Rivierenbuurt | |
| Centrum West | |
| Centrum Oost | |
| Westerpark | |
| Other values (18) |
Length
| Max length | 38 |
|---|---|
| Median length | 23 |
| Mean length | 17.515729 |
| Min length | 6 |
Characters and Unicode
| Total characters | 327947 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | De Pijp / Rivierenbuurt |
|---|---|
| 2nd row | Centrum West |
| 3rd row | Watergraafsmeer |
| 4th row | Centrum West |
| 5th row | De Baarsjes / Oud West |
Common Values
| Value | Count | Frequency (%) |
| De Baarsjes / Oud West | 3289 | |
| De Pijp / Rivierenbuurt | 2378 | |
| Centrum West | 2225 | |
| Centrum Oost | 1730 | |
| Westerpark | 1430 | |
| Noord-West / Noord-Midden | 1418 | |
| Oud Oost | 1169 | 6.2% |
| Bos en Lommer | 988 | 5.3% |
| Oostelijk Havengebied / Indische Buurt | 921 | 4.9% |
| Watergraafsmeer | 517 | 2.8% |
| Other values (13) | 2658 |
Length
| Value | Count | Frequency (%) |
| 8985 | ||
| de | 5781 | |
| west | 5755 | |
| oud | 4952 | 8.8% |
| centrum | 4054 | 7.2% |
| baarsjes | 3289 | 5.8% |
| oost | 3217 | 5.7% |
| rivierenbuurt | 2378 | 4.2% |
| pijp | 2378 | 4.2% |
| westerpark | 1430 | 2.5% |
| Other values (27) | 14130 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 41125 | 12.5% |
| 37626 | 11.5% | |
| r | 24877 | 7.6% |
| s | 22215 | 6.8% |
| t | 22148 | 6.8% |
| u | 17169 | 5.2% |
| d | 14742 | 4.5% |
| o | 14591 | 4.4% |
| i | 12545 | 3.8% |
| a | 11932 | 3.6% |
| Other values (33) | 108977 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 229288 | |
| Uppercase Letter | 49212 | 15.0% |
| Space Separator | 37626 | 11.5% |
| Other Punctuation | 8985 | 2.7% |
| Dash Punctuation | 2836 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 41125 | |
| r | 24877 | |
| s | 22215 | |
| t | 22148 | |
| u | 17169 | |
| d | 14742 | 6.4% |
| o | 14591 | 6.4% |
| i | 12545 | 5.5% |
| a | 11932 | 5.2% |
| n | 11659 | 5.1% |
| Other values (13) | 36285 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 9253 | |
| W | 9135 | |
| D | 5823 | |
| B | 5644 | |
| C | 4054 | |
| N | 3906 | |
| R | 2378 | 4.8% |
| P | 2378 | 4.8% |
| M | 1418 | 2.9% |
| I | 1299 | 2.6% |
| Other values (7) | 3924 |
Space Separator
| Value | Count | Frequency (%) |
| 37626 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 8985 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2836 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 278500 | |
| Common | 49447 | 15.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 41125 | |
| r | 24877 | 8.9% |
| s | 22215 | 8.0% |
| t | 22148 | 8.0% |
| u | 17169 | 6.2% |
| d | 14742 | 5.3% |
| o | 14591 | 5.2% |
| i | 12545 | 4.5% |
| a | 11932 | 4.3% |
| n | 11659 | 4.2% |
| Other values (30) | 85497 |
Common
| Value | Count | Frequency (%) |
| 37626 | ||
| / | 8985 | 18.2% |
| - | 2836 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 327947 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 41125 | 12.5% |
| 37626 | 11.5% | |
| r | 24877 | 7.6% |
| s | 22215 | 6.8% |
| t | 22148 | 6.8% |
| u | 17169 | 5.2% |
| d | 14742 | 4.5% |
| o | 14591 | 4.4% |
| i | 12545 | 3.8% |
| a | 11932 | 3.6% |
| Other values (33) | 108977 |
reviews
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 284 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.741548 |
| Minimum | 0 |
|---|---|
| Maximum | 532 |
| Zeros | 2984 |
| Zeros (%) | 15.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 146.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 6 |
| Q3 | 17 |
| 95-th percentile | 67 |
| Maximum | 532 |
| Range | 532 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 33.52263 |
|---|---|
| Coefficient of variation (CV) | 2.0023614 |
| Kurtosis | 43.756435 |
| Mean | 16.741548 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 5.5027866 |
| Sum | 313452 |
| Variance | 1123.7667 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2984 | 15.9% |
| 1 | 1510 | 8.1% |
| 2 | 1246 | 6.7% |
| 3 | 1103 | 5.9% |
| 4 | 925 | 4.9% |
| 5 | 876 | 4.7% |
| 6 | 741 | 4.0% |
| 7 | 683 | 3.6% |
| 8 | 590 | 3.2% |
| 9 | 529 | 2.8% |
| Other values (274) | 7536 |
| Value | Count | Frequency (%) |
| 0 | 2984 | |
| 1 | 1510 | |
| 2 | 1246 | |
| 3 | 1103 | 5.9% |
| 4 | 925 | 4.9% |
| 5 | 876 | 4.7% |
| 6 | 741 | 4.0% |
| 7 | 683 | 3.6% |
| 8 | 590 | 3.2% |
| 9 | 529 | 2.8% |
| Value | Count | Frequency (%) |
| 532 | 1 | |
| 465 | 1 | |
| 463 | 1 | |
| 452 | 1 | |
| 447 | 1 | |
| 443 | 2 | |
| 433 | 1 | |
| 430 | 2 | |
| 425 | 1 | |
| 410 | 2 |
overall_satisfaction
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.301127 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 5748 |
| Zeros (%) | 30.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 146.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4.5 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.2135575 |
|---|---|
| Coefficient of variation (CV) | 0.67054601 |
| Kurtosis | -1.3171244 |
| Mean | 3.301127 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.79270161 |
| Sum | 61807 |
| Variance | 4.8998369 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 7708 | |
| 0 | 5748 | |
| 4.5 | 4559 | |
| 4 | 577 | 3.1% |
| 3.5 | 109 | 0.6% |
| 3 | 19 | 0.1% |
| 1.5 | 1 | < 0.1% |
| 2.5 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 5748 | |
| 1 | 1 | < 0.1% |
| 1.5 | 1 | < 0.1% |
| 2.5 | 1 | < 0.1% |
| 3 | 19 | 0.1% |
| 3.5 | 109 | 0.6% |
| 4 | 577 | 3.1% |
| 4.5 | 4559 | |
| 5 | 7708 |
| Value | Count | Frequency (%) |
| 5 | 7708 | |
| 4.5 | 4559 | |
| 4 | 577 | 3.1% |
| 3.5 | 109 | 0.6% |
| 3 | 19 | 0.1% |
| 2.5 | 1 | < 0.1% |
| 1.5 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 0 | 5748 |
accommodates
Real number (ℝ)
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.922021 |
| Minimum | 1 |
|---|---|
| Maximum | 17 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 146.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 17 |
| Range | 16 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.3275239 |
|---|---|
| Coefficient of variation (CV) | 0.45431703 |
| Kurtosis | 14.340675 |
| Mean | 2.922021 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.3887979 |
| Sum | 54709 |
| Variance | 1.7623198 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 10024 | |
| 4 | 5579 | |
| 3 | 1585 | 8.5% |
| 6 | 476 | 2.5% |
| 5 | 471 | 2.5% |
| 1 | 367 | 2.0% |
| 8 | 105 | 0.6% |
| 7 | 52 | 0.3% |
| 16 | 20 | 0.1% |
| 10 | 16 | 0.1% |
| Other values (6) | 28 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 367 | 2.0% |
| 2 | 10024 | |
| 3 | 1585 | 8.5% |
| 4 | 5579 | |
| 5 | 471 | 2.5% |
| 6 | 476 | 2.5% |
| 7 | 52 | 0.3% |
| 8 | 105 | 0.6% |
| 9 | 8 | < 0.1% |
| 10 | 16 | 0.1% |
| Value | Count | Frequency (%) |
| 17 | 1 | < 0.1% |
| 16 | 20 | 0.1% |
| 14 | 6 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12 | 10 | 0.1% |
| 11 | 2 | < 0.1% |
| 10 | 16 | 0.1% |
| 9 | 8 | < 0.1% |
| 8 | 105 | |
| 7 | 52 |
bedrooms
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4303797 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 1154 |
| Zeros (%) | 6.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 146.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.87901869 |
|---|---|
| Coefficient of variation (CV) | 0.61453519 |
| Kurtosis | 5.6257566 |
| Mean | 1.4303797 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.6013041 |
| Sum | 26781 |
| Variance | 0.77267386 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 11101 | |
| 2 | 4456 | |
| 3 | 1444 | 7.7% |
| 0 | 1154 | 6.2% |
| 4 | 473 | 2.5% |
| 5 | 62 | 0.3% |
| 6 | 19 | 0.1% |
| 10 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1154 | 6.2% |
| 1 | 11101 | |
| 2 | 4456 | |
| 3 | 1444 | 7.7% |
| 4 | 473 | 2.5% |
| 5 | 62 | 0.3% |
| 6 | 19 | 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 3 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 5 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 3 | < 0.1% |
| 7 | 4 | < 0.1% |
| 6 | 19 | 0.1% |
| 5 | 62 | 0.3% |
| 4 | 473 | 2.5% |
| 3 | 1444 | 7.7% |
| 2 | 4456 | |
| 1 | 11101 |
bathrooms
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 18723 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 146.4 KiB |
price
Real number (ℝ)
| Distinct | 423 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 166.59948 |
| Minimum | 12 |
|---|---|
| Maximum | 6000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 146.4 KiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 72 |
| Q1 | 108 |
| median | 144 |
| Q3 | 192 |
| 95-th percentile | 330 |
| Maximum | 6000 |
| Range | 5988 |
| Interquartile range (IQR) | 84 |
Descriptive statistics
| Standard deviation | 108.94385 |
|---|---|
| Coefficient of variation (CV) | 0.65392672 |
| Kurtosis | 521.86526 |
| Mean | 166.59948 |
| Median Absolute Deviation (MAD) | 36 |
| Skewness | 12.768987 |
| Sum | 3119242 |
| Variance | 11868.762 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 119 | 1023 | 5.5% |
| 180 | 1001 | 5.3% |
| 144 | 887 | 4.7% |
| 150 | 621 | 3.3% |
| 132 | 588 | 3.1% |
| 108 | 562 | 3.0% |
| 96 | 520 | 2.8% |
| 114 | 509 | 2.7% |
| 118 | 508 | 2.7% |
| 240 | 495 | 2.6% |
| Other values (413) | 12009 |
| Value | Count | Frequency (%) |
| 12 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 24 | 6 | |
| 25 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 29 | 2 | < 0.1% |
| 30 | 6 |
| Value | Count | Frequency (%) |
| 6000 | 1 | |
| 3770 | 1 | |
| 1920 | 1 | |
| 1799 | 1 | |
| 1558 | 1 | |
| 1428 | 1 | |
| 1412 | 1 | |
| 1386 | 1 | |
| 1343 | 1 | |
| 1319 | 1 |
minstay
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 18723 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 146.4 KiB |
name
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 18150 |
|---|---|
| Distinct (%) | 97.2% |
| Missing | 52 |
| Missing (%) | 0.3% |
| Memory size | 1.7 MiB |
| Amsterdam | 36 |
|---|---|
| Lovely apartment near Vondelpark | 10 |
| Beautiful apartment in Amsterdam | 8 |
| Cosy apartment in Amsterdam | 8 |
| Spacious family house with garden | 8 |
| Other values (18145) |
Length
| Max length | 78 |
|---|---|
| Median length | 50 |
| Mean length | 36.092336 |
| Min length | 1 |
Characters and Unicode
| Total characters | 673880 |
|---|---|
| Distinct characters | 157 |
| Distinct categories | 20 ? |
| Distinct scripts | 4 ? |
| Distinct blocks | 9 ? |
Unique
| Unique | 17814 ? |
|---|---|
| Unique (%) | 95.4% |
Sample
| 1st row | Red Light/ Canal view apartment (Shared) |
|---|---|
| 2nd row | Sunny and Cozy Living room in quite neighbours |
| 3rd row | Amsterdam |
| 4th row | Canal boat RIDE in Amsterdam |
| 5th row | One room for rent in a three room appartment |
Common Values
| Value | Count | Frequency (%) |
| Amsterdam | 36 | 0.2% |
| Lovely apartment near Vondelpark | 10 | 0.1% |
| Beautiful apartment in Amsterdam | 8 | < 0.1% |
| Cosy apartment in Amsterdam | 8 | < 0.1% |
| Spacious family house with garden | 8 | < 0.1% |
| Magnificent panoramic city view | 8 | < 0.1% |
| Nice comfy room, magnificent view | 7 | < 0.1% |
| Lovely apartment in Amsterdam | 7 | < 0.1% |
| Spacious apartment near Vondelpark | 7 | < 0.1% |
| Cosy apartment near Vondelpark | 6 | < 0.1% |
| Other values (18140) | 18566 | |
| (Missing) | 52 | 0.3% |
Length
| Value | Count | Frequency (%) |
| apartment | 7118 | 6.7% |
| in | 5730 | 5.4% |
| amsterdam | 3588 | 3.4% |
| 3195 | 3.0% | |
| with | 2669 | 2.5% |
| the | 2165 | 2.0% |
| spacious | 2082 | 2.0% |
| city | 1906 | 1.8% |
| centre | 1768 | 1.7% |
| room | 1728 | 1.6% |
| Other values (4867) | 73723 |
Most occurring characters
| Value | Count | Frequency (%) |
| 87491 | 13.0% | |
| e | 59230 | 8.8% |
| t | 55217 | 8.2% |
| a | 52626 | 7.8% |
| r | 42831 | 6.4% |
| n | 39759 | 5.9% |
| o | 35472 | 5.3% |
| i | 32482 | 4.8% |
| m | 26379 | 3.9% |
| s | 21195 | 3.1% |
| Other values (147) | 221198 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 510398 | |
| Space Separator | 87492 | 13.0% |
| Uppercase Letter | 54936 | 8.2% |
| Other Punctuation | 11184 | 1.7% |
| Decimal Number | 5572 | 0.8% |
| Dash Punctuation | 1595 | 0.2% |
| Math Symbol | 1136 | 0.2% |
| Close Punctuation | 621 | 0.1% |
| Open Punctuation | 588 | 0.1% |
| Other Symbol | 236 | < 0.1% |
| Other values (10) | 122 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 姆 | 2 | 5.1% |
| 公 | 2 | 5.1% |
| 丹 | 2 | 5.1% |
| 特 | 2 | 5.1% |
| 斯 | 2 | 5.1% |
| 阿 | 2 | 5.1% |
| 到 | 2 | 5.1% |
| 站 | 1 | 2.6% |
| 十 | 1 | 2.6% |
| 五 | 1 | 2.6% |
| Other values (22) | 22 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 59230 | |
| t | 55217 | |
| a | 52626 | |
| r | 42831 | 8.4% |
| n | 39759 | 7.8% |
| o | 35472 | 6.9% |
| i | 32482 | 6.4% |
| m | 26379 | 5.2% |
| s | 21195 | 4.2% |
| p | 19825 | 3.9% |
| Other values (20) | 125382 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 8892 | |
| C | 6863 | |
| S | 4399 | 8.0% |
| L | 3283 | 6.0% |
| B | 3251 | 5.9% |
| R | 2791 | 5.1% |
| P | 2694 | 4.9% |
| E | 2341 | 4.3% |
| T | 2219 | 4.0% |
| N | 2194 | 4.0% |
| Other values (17) | 16009 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2817 | |
| ! | 2756 | |
| & | 1686 | |
| . | 1473 | |
| ' | 831 | 7.4% |
| / | 587 | 5.2% |
| @ | 315 | 2.8% |
| " | 285 | 2.5% |
| : | 189 | 1.7% |
| * | 154 | 1.4% |
| Other values (7) | 91 | 0.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1885 | |
| 1 | 992 | |
| 0 | 741 | 13.3% |
| 5 | 498 | 8.9% |
| 3 | 463 | 8.3% |
| 4 | 412 | 7.4% |
| 8 | 150 | 2.7% |
| 6 | 150 | 2.7% |
| 9 | 145 | 2.6% |
| 7 | 136 | 2.4% |
Other Symbol
| Value | Count | Frequency (%) |
| ★ | 171 | |
| ☆ | 33 | 14.0% |
| ❤ | 14 | 5.9% |
| ♥ | 5 | 2.1% |
| ♡ | 5 | 2.1% |
| ° | 3 | 1.3% |
| ⭐ | 3 | 1.3% |
| ☕ | 1 | 0.4% |
| ☺ | 1 | 0.4% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 660 | |
| | | 460 | |
| < | 5 | 0.4% |
| > | 4 | 0.4% |
| = | 3 | 0.3% |
| ~ | 2 | 0.2% |
| ⊕ | 1 | 0.1% |
| ÷ | 1 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 614 | |
| ] | 6 | 1.0% |
| 】 | 1 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 581 | |
| [ | 6 | 1.0% |
| 【 | 1 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 87491 | ||
| 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1593 | |
| – | 2 | 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ️ | 15 | |
| ︎ | 1 | 6.2% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 9 | |
| ” | 2 | 18.2% |
Control
| Value | Count | Frequency (%) |
| 6 | ||
| 6 |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 4 | |
| $ | 1 | 20.0% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 3 | |
| “ | 2 |
Other Number
| Value | Count | Frequency (%) |
| ² | 22 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 4 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 565334 | |
| Common | 108491 | 16.1% |
| Han | 39 | < 0.1% |
| Inherited | 16 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 87491 | ||
| , | 2817 | 2.6% |
| ! | 2756 | 2.5% |
| 2 | 1885 | 1.7% |
| & | 1686 | 1.6% |
| - | 1593 | 1.5% |
| . | 1473 | 1.4% |
| 1 | 992 | 0.9% |
| ' | 831 | 0.8% |
| 0 | 741 | 0.7% |
| Other values (56) | 6226 | 5.7% |
Latin
| Value | Count | Frequency (%) |
| e | 59230 | 10.5% |
| t | 55217 | 9.8% |
| a | 52626 | 9.3% |
| r | 42831 | 7.6% |
| n | 39759 | 7.0% |
| o | 35472 | 6.3% |
| i | 32482 | 5.7% |
| m | 26379 | 4.7% |
| s | 21195 | 3.7% |
| p | 19825 | 3.5% |
| Other values (47) | 180318 |
Han
| Value | Count | Frequency (%) |
| 姆 | 2 | 5.1% |
| 公 | 2 | 5.1% |
| 丹 | 2 | 5.1% |
| 特 | 2 | 5.1% |
| 斯 | 2 | 5.1% |
| 阿 | 2 | 5.1% |
| 到 | 2 | 5.1% |
| 站 | 1 | 2.6% |
| 十 | 1 | 2.6% |
| 五 | 1 | 2.6% |
| Other values (22) | 22 |
Inherited
| Value | Count | Frequency (%) |
| ️ | 15 | |
| ︎ | 1 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 673492 | |
| Misc Symbols | 216 | < 0.1% |
| None | 64 | < 0.1% |
| CJK | 39 | < 0.1% |
| Punctuation | 34 | < 0.1% |
| VS | 16 | < 0.1% |
| Dingbats | 14 | < 0.1% |
| Currency Symbols | 4 | < 0.1% |
| Math Operators | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 87491 | 13.0% | |
| e | 59230 | 8.8% |
| t | 55217 | 8.2% |
| a | 52626 | 7.8% |
| r | 42831 | 6.4% |
| n | 39759 | 5.9% |
| o | 35472 | 5.3% |
| i | 32482 | 4.8% |
| m | 26379 | 3.9% |
| s | 21195 | 3.1% |
| Other values (82) | 220810 |
Misc Symbols
| Value | Count | Frequency (%) |
| ★ | 171 | |
| ☆ | 33 | 15.3% |
| ♥ | 5 | 2.3% |
| ♡ | 5 | 2.3% |
| ☕ | 1 | 0.5% |
| ☺ | 1 | 0.5% |
None
| Value | Count | Frequency (%) |
| ² | 22 | |
| é | 15 | |
| ´ | 4 | 6.2% |
| à | 4 | 6.2% |
| ° | 3 | 4.7% |
| É | 3 | 4.7% |
| ⭐ | 3 | 4.7% |
| , | 2 | 3.1% |
| á | 2 | 3.1% |
| 1 | 1.6% | |
| Other values (5) | 5 | 7.8% |
VS
| Value | Count | Frequency (%) |
| ️ | 15 | |
| ︎ | 1 | 6.2% |
Punctuation
| Value | Count | Frequency (%) |
| • | 15 | |
| ’ | 9 | |
| ‘ | 3 | 8.8% |
| ” | 2 | 5.9% |
| – | 2 | 5.9% |
| “ | 2 | 5.9% |
| | 1 | 2.9% |
Dingbats
| Value | Count | Frequency (%) |
| ❤ | 14 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 4 |
CJK
| Value | Count | Frequency (%) |
| 姆 | 2 | 5.1% |
| 公 | 2 | 5.1% |
| 丹 | 2 | 5.1% |
| 特 | 2 | 5.1% |
| 斯 | 2 | 5.1% |
| 阿 | 2 | 5.1% |
| 到 | 2 | 5.1% |
| 站 | 1 | 2.6% |
| 十 | 1 | 2.6% |
| 五 | 1 | 2.6% |
| Other values (22) | 22 |
Math Operators
| Value | Count | Frequency (%) |
| ⊕ | 1 |
last_modified
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 18723 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 2017-07-23 13:06:27.391699 | 1 |
|---|---|
| 2017-07-22 17:50:54.562470 | 1 |
| 2017-07-22 17:50:37.799843 | 1 |
| 2017-07-22 17:50:37.804073 | 1 |
| 2017-07-22 17:50:37.808374 | 1 |
| Other values (18718) |
Length
| Max length | 26 |
|---|---|
| Median length | 26 |
| Mean length | 26 |
| Min length | 26 |
Characters and Unicode
| Total characters | 486798 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18723 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2017-07-23 13:06:27.391699 |
|---|---|
| 2nd row | 2017-07-23 13:06:23.607187 |
| 3rd row | 2017-07-23 13:06:23.603546 |
| 4th row | 2017-07-23 13:06:22.689787 |
| 5th row | 2017-07-23 13:06:19.681469 |
Common Values
| Value | Count | Frequency (%) |
| 2017-07-23 13:06:27.391699 | 1 | < 0.1% |
| 2017-07-22 17:50:54.562470 | 1 | < 0.1% |
| 2017-07-22 17:50:37.799843 | 1 | < 0.1% |
| 2017-07-22 17:50:37.804073 | 1 | < 0.1% |
| 2017-07-22 17:50:37.808374 | 1 | < 0.1% |
| 2017-07-22 17:50:37.811917 | 1 | < 0.1% |
| 2017-07-22 17:50:41.366311 | 1 | < 0.1% |
| 2017-07-22 17:50:41.404879 | 1 | < 0.1% |
| 2017-07-22 17:50:44.108265 | 1 | < 0.1% |
| 2017-07-22 17:50:45.777034 | 1 | < 0.1% |
| Other values (18713) | 18713 |
Length
| Value | Count | Frequency (%) |
| 2017-07-22 | 13694 | |
| 2017-07-23 | 5029 | 13.4% |
| 13:06:07.443359 | 1 | < 0.1% |
| 13:06:22.689787 | 1 | < 0.1% |
| 13:06:19.681469 | 1 | < 0.1% |
| 13:06:19.663975 | 1 | < 0.1% |
| 13:06:09.988016 | 1 | < 0.1% |
| 13:06:09.984748 | 1 | < 0.1% |
| 13:05:45.744708 | 1 | < 0.1% |
| 13:06:07.452609 | 1 | < 0.1% |
| Other values (18715) | 18715 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 81800 | |
| 0 | 65421 | |
| 7 | 55572 | |
| 1 | 48766 | |
| - | 37446 | |
| : | 37446 | |
| 3 | 29626 | 6.1% |
| 5 | 22515 | 4.6% |
| 4 | 19641 | 4.0% |
| 6 | 19288 | 4.0% |
| Other values (4) | 69277 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 374460 | |
| Other Punctuation | 56169 | 11.5% |
| Dash Punctuation | 37446 | 7.7% |
| Space Separator | 18723 | 3.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 81800 | |
| 0 | 65421 | |
| 7 | 55572 | |
| 1 | 48766 | |
| 3 | 29626 | 7.9% |
| 5 | 22515 | 6.0% |
| 4 | 19641 | 5.2% |
| 6 | 19288 | 5.2% |
| 8 | 16269 | 4.3% |
| 9 | 15562 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 37446 | |
| . | 18723 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37446 |
Space Separator
| Value | Count | Frequency (%) |
| 18723 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 486798 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 81800 | |
| 0 | 65421 | |
| 7 | 55572 | |
| 1 | 48766 | |
| - | 37446 | |
| : | 37446 | |
| 3 | 29626 | 6.1% |
| 5 | 22515 | 4.6% |
| 4 | 19641 | 4.0% |
| 6 | 19288 | 4.0% |
| Other values (4) | 69277 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 486798 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 81800 | |
| 0 | 65421 | |
| 7 | 55572 | |
| 1 | 48766 | |
| - | 37446 | |
| : | 37446 | |
| 3 | 29626 | 6.1% |
| 5 | 22515 | 4.6% |
| 4 | 19641 | 4.0% |
| 6 | 19288 | 4.0% |
| Other values (4) | 69277 |
latitude
Real number (ℝ)
| Distinct | 15595 |
|---|---|
| Distinct (%) | 83.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.365261 |
| Minimum | 52.2962 |
|---|---|
| Maximum | 52.42498 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 146.4 KiB |
Quantile statistics
| Minimum | 52.2962 |
|---|---|
| 5-th percentile | 52.343288 |
| Q1 | 52.355254 |
| median | 52.364628 |
| Q3 | 52.374797 |
| 95-th percentile | 52.389372 |
| Maximum | 52.42498 |
| Range | 0.12878 |
| Interquartile range (IQR) | 0.019544 |
Descriptive statistics
| Standard deviation | 0.015142042 |
|---|---|
| Coefficient of variation (CV) | 0.00028916198 |
| Kurtosis | 1.4182599 |
| Mean | 52.365261 |
| Median Absolute Deviation (MAD) | 0.009735 |
| Skewness | 0.0079054175 |
| Sum | 980434.77 |
| Variance | 0.00022928145 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 52.354646 | 5 | < 0.1% |
| 52.360546 | 5 | < 0.1% |
| 52.361364 | 5 | < 0.1% |
| 52.366852 | 5 | < 0.1% |
| 52.355191 | 4 | < 0.1% |
| 52.354748 | 4 | < 0.1% |
| 52.36104 | 4 | < 0.1% |
| 52.361689 | 4 | < 0.1% |
| 52.362592 | 4 | < 0.1% |
| 52.361118 | 4 | < 0.1% |
| Other values (15585) | 18679 |
| Value | Count | Frequency (%) |
| 52.2962 | 1 | |
| 52.297203 | 1 | |
| 52.299763 | 1 | |
| 52.299846 | 1 | |
| 52.299875 | 1 | |
| 52.300105 | 1 | |
| 52.30013 | 1 | |
| 52.300915 | 1 | |
| 52.301257 | 1 | |
| 52.301683 | 1 |
| Value | Count | Frequency (%) |
| 52.42498 | 1 | |
| 52.424641 | 1 | |
| 52.424255 | 1 | |
| 52.423647 | 1 | |
| 52.423498 | 1 | |
| 52.423432 | 1 | |
| 52.423321 | 1 | |
| 52.422827 | 1 | |
| 52.422232 | 1 | |
| 52.422228 | 1 |
longitude
Real number (ℝ)
| Distinct | 17157 |
|---|---|
| Distinct (%) | 91.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.8885852 |
| Minimum | 4.763264 |
|---|---|
| Maximum | 5.027689 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 146.4 KiB |
Quantile statistics
| Minimum | 4.763264 |
|---|---|
| 5-th percentile | 4.8453101 |
| Q1 | 4.8643445 |
| median | 4.885994 |
| Q3 | 4.90748 |
| 95-th percentile | 4.9445407 |
| Maximum | 5.027689 |
| Range | 0.264425 |
| Interquartile range (IQR) | 0.0431355 |
Descriptive statistics
| Standard deviation | 0.034536882 |
|---|---|
| Coefficient of variation (CV) | 0.0070648011 |
| Kurtosis | 1.2170001 |
| Mean | 4.8885852 |
| Median Absolute Deviation (MAD) | 0.021585 |
| Skewness | 0.53824711 |
| Sum | 91528.98 |
| Variance | 0.0011927962 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.907187 | 5 | < 0.1% |
| 4.888738 | 4 | < 0.1% |
| 4.904646 | 4 | < 0.1% |
| 4.856525 | 4 | < 0.1% |
| 4.893506 | 4 | < 0.1% |
| 4.875611 | 4 | < 0.1% |
| 4.893017 | 4 | < 0.1% |
| 4.86301 | 4 | < 0.1% |
| 4.877004 | 4 | < 0.1% |
| 4.861512 | 4 | < 0.1% |
| Other values (17147) | 18682 |
| Value | Count | Frequency (%) |
| 4.763264 | 1 | |
| 4.768452 | 1 | |
| 4.769151 | 1 | |
| 4.771083 | 1 | |
| 4.772725 | 1 | |
| 4.772822 | 1 | |
| 4.775168 | 1 | |
| 4.775748 | 1 | |
| 4.77647 | 1 | |
| 4.77764 | 1 |
| Value | Count | Frequency (%) |
| 5.027689 | 1 | |
| 5.026701 | 1 | |
| 5.015737 | 1 | |
| 5.013557 | 1 | |
| 5.013316 | 1 | |
| 5.013075 | 1 | |
| 5.012549 | 1 | |
| 5.011693 | 1 | |
| 5.011688 | 1 | |
| 5.011569 | 1 |
location
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 18723 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.9 MiB |
| 0101000020E610000033FAD170CA8C13403BC5AA41982D4A40 | 1 |
|---|---|
| 0101000020E6100000C30DF8FC30821340342BDB87BC314A40 | 1 |
| 0101000020E61000007B849A2155B41340D6C9198A3B2E4A40 | 1 |
| 0101000020E610000056D3F544D7851340C79E3D97A9314A40 | 1 |
| 0101000020E61000002CB7B41A12A71340C5E40D30F32D4A40 | 1 |
| Other values (18718) |
Length
| Max length | 50 |
|---|---|
| Median length | 50 |
| Mean length | 50 |
| Min length | 50 |
Characters and Unicode
| Total characters | 936150 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18723 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0101000020E610000033FAD170CA8C13403BC5AA41982D4A40 |
|---|---|
| 2nd row | 0101000020E6100000842A357BA095134042791F4773304A40 |
| 3rd row | 0101000020E6100000A51133FB3CC613403543AA285E2B4A40 |
| 4th row | 0101000020E6100000DF180280638F134085EE92382B304A40 |
| 5th row | 0101000020E6100000CD902A8A57691340187B2FBE682F4A40 |
Common Values
| Value | Count | Frequency (%) |
| 0101000020E610000033FAD170CA8C13403BC5AA41982D4A40 | 1 | < 0.1% |
| 0101000020E6100000C30DF8FC30821340342BDB87BC314A40 | 1 | < 0.1% |
| 0101000020E61000007B849A2155B41340D6C9198A3B2E4A40 | 1 | < 0.1% |
| 0101000020E610000056D3F544D7851340C79E3D97A9314A40 | 1 | < 0.1% |
| 0101000020E61000002CB7B41A12A71340C5E40D30F32D4A40 | 1 | < 0.1% |
| 0101000020E6100000EE7A698A0087134052F01472A5304A40 | 1 | < 0.1% |
| 0101000020E6100000CC24EA059F861340211D1EC2F8314A40 | 1 | < 0.1% |
| 0101000020E610000068791EDC9DC513403ECE3461FB2D4A40 | 1 | < 0.1% |
| 0101000020E6100000B9FC87F4DBD713406ADE718A8E304A40 | 1 | < 0.1% |
| 0101000020E61000002FFD4B5299D213402FC03E3A75314A40 | 1 | < 0.1% |
| Other values (18713) | 18713 |
Length
| Value | Count | Frequency (%) |
| 0101000020e610000033fad170ca8c13403bc5aa41982d4a40 | 1 | < 0.1% |
| 0101000020e6100000fddcd0949d8e13404243ff04172f4a40 | 1 | < 0.1% |
| 0101000020e6100000df180280638f134085ee92382b304a40 | 1 | < 0.1% |
| 0101000020e6100000cd902a8a57691340187b2fbe682f4a40 | 1 | < 0.1% |
| 0101000020e6100000b090b932a896134060c8ea56cf2b4a40 | 1 | < 0.1% |
| 0101000020e61000005d70067fbfb813400b45ba9f53304a40 | 1 | < 0.1% |
| 0101000020e6100000dd09f65fe7761340d925aab706304a40 | 1 | < 0.1% |
| 0101000020e6100000459e245d33991340a439b2f2cb2c4a40 | 1 | < 0.1% |
| 0101000020e610000032c687d9cba613409fad8383bd2d4a40 | 1 | < 0.1% |
| 0101000020e6100000bc96900f7a961340e0d6dd3cd52b4a40 | 1 | < 0.1% |
| Other values (18713) | 18713 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 289829 | |
| 1 | 100408 | 10.7% |
| 4 | 81235 | 8.7% |
| 2 | 58034 | 6.2% |
| 3 | 48096 | 5.1% |
| E | 47519 | 5.1% |
| 6 | 46156 | 4.9% |
| A | 45108 | 4.8% |
| D | 28544 | 3.0% |
| 8 | 28194 | 3.0% |
| Other values (6) | 163027 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 732898 | |
| Uppercase Letter | 203252 | 21.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 289829 | |
| 1 | 100408 | 13.7% |
| 4 | 81235 | 11.1% |
| 2 | 58034 | 7.9% |
| 3 | 48096 | 6.6% |
| 6 | 46156 | 6.3% |
| 8 | 28194 | 3.8% |
| 7 | 28082 | 3.8% |
| 9 | 27873 | 3.8% |
| 5 | 24991 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 47519 | |
| A | 45108 | |
| D | 28544 | |
| F | 28123 | |
| C | 27427 | |
| B | 26531 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 732898 | |
| Latin | 203252 | 21.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 289829 | |
| 1 | 100408 | 13.7% |
| 4 | 81235 | 11.1% |
| 2 | 58034 | 7.9% |
| 3 | 48096 | 6.6% |
| 6 | 46156 | 6.3% |
| 8 | 28194 | 3.8% |
| 7 | 28082 | 3.8% |
| 9 | 27873 | 3.8% |
| 5 | 24991 | 3.4% |
Latin
| Value | Count | Frequency (%) |
| E | 47519 | |
| A | 45108 | |
| D | 28544 | |
| F | 28123 | |
| C | 27427 | |
| B | 26531 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 936150 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 289829 | |
| 1 | 100408 | 10.7% |
| 4 | 81235 | 8.7% |
| 2 | 58034 | 6.2% |
| 3 | 48096 | 5.1% |
| E | 47519 | 5.1% |
| 6 | 46156 | 4.9% |
| A | 45108 | 4.8% |
| D | 28544 | 3.0% |
| 8 | 28194 | 3.0% |
| Other values (6) | 163027 |
| room_id | host_id | reviews | overall_satisfaction | accommodates | bedrooms | price | latitude | longitude | room_type | neighborhood | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| room_id | 1.000 | 0.496 | -0.542 | -0.350 | 0.007 | -0.021 | -0.016 | -0.028 | 0.010 | 0.043 | 0.041 |
| host_id | 0.496 | 1.000 | -0.235 | -0.177 | 0.004 | -0.023 | -0.050 | -0.030 | 0.011 | 0.046 | 0.055 |
| reviews | -0.542 | -0.235 | 1.000 | 0.682 | -0.073 | -0.137 | -0.088 | 0.070 | -0.009 | 0.205 | 0.020 |
| overall_satisfaction | -0.350 | -0.177 | 0.682 | 1.000 | -0.059 | -0.062 | 0.014 | 0.046 | -0.011 | 0.117 | 0.040 |
| accommodates | 0.007 | 0.004 | -0.073 | -0.059 | 1.000 | 0.723 | 0.555 | -0.021 | 0.087 | 0.198 | 0.070 |
| bedrooms | -0.021 | -0.023 | -0.137 | -0.062 | 0.723 | 1.000 | 0.504 | -0.045 | 0.051 | 0.218 | 0.086 |
| price | -0.016 | -0.050 | -0.088 | 0.014 | 0.555 | 0.504 | 1.000 | -0.001 | 0.059 | 0.019 | 0.023 |
| latitude | -0.028 | -0.030 | 0.070 | 0.046 | -0.021 | -0.045 | -0.001 | 1.000 | -0.123 | 0.083 | 0.696 |
| longitude | 0.010 | 0.011 | -0.009 | -0.011 | 0.087 | 0.051 | 0.059 | -0.123 | 1.000 | 0.108 | 0.693 |
| room_type | 0.043 | 0.046 | 0.205 | 0.117 | 0.198 | 0.218 | 0.019 | 0.083 | 0.108 | 1.000 | 0.137 |
| neighborhood | 0.041 | 0.055 | 0.020 | 0.040 | 0.070 | 0.086 | 0.023 | 0.696 | 0.693 | 0.137 | 1.000 |
| room_id | survey_id | host_id | room_type | country | city | borough | neighborhood | reviews | overall_satisfaction | accommodates | bedrooms | bathrooms | price | minstay | name | last_modified | latitude | longitude | location | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10176931 | 1476 | 49180562 | Shared room | NaN | Amsterdam | NaN | De Pijp / Rivierenbuurt | 7 | 4.5 | 2 | 1.0 | NaN | 156.0 | NaN | Red Light/ Canal view apartment (Shared) | 2017-07-23 13:06:27.391699 | 52.356209 | 4.887491 | 0101000020E610000033FAD170CA8C13403BC5AA41982D4A40 |
| 1 | 8935871 | 1476 | 46718394 | Shared room | NaN | Amsterdam | NaN | Centrum West | 45 | 4.5 | 4 | 1.0 | NaN | 126.0 | NaN | Sunny and Cozy Living room in quite neighbours | 2017-07-23 13:06:23.607187 | 52.378518 | 4.896120 | 0101000020E6100000842A357BA095134042791F4773304A40 |
| 2 | 14011697 | 1476 | 10346595 | Shared room | NaN | Amsterdam | NaN | Watergraafsmeer | 1 | 0.0 | 3 | 1.0 | NaN | 132.0 | NaN | Amsterdam | 2017-07-23 13:06:23.603546 | 52.338811 | 4.943592 | 0101000020E6100000A51133FB3CC613403543AA285E2B4A40 |
| 3 | 6137978 | 1476 | 8685430 | Shared room | NaN | Amsterdam | NaN | Centrum West | 7 | 5.0 | 4 | 1.0 | NaN | 121.0 | NaN | Canal boat RIDE in Amsterdam | 2017-07-23 13:06:22.689787 | 52.376319 | 4.890028 | 0101000020E6100000DF180280638F134085EE92382B304A40 |
| 4 | 18630616 | 1476 | 70191803 | Shared room | NaN | Amsterdam | NaN | De Baarsjes / Oud West | 1 | 0.0 | 2 | 1.0 | NaN | 93.0 | NaN | One room for rent in a three room appartment | 2017-07-23 13:06:19.681469 | 52.370384 | 4.852873 | 0101000020E6100000CD902A8A57691340187B2FBE682F4A40 |
| 5 | 5790170 | 1476 | 29968916 | Shared room | NaN | Amsterdam | NaN | De Pijp / Rivierenbuurt | 184 | 4.5 | 2 | 1.0 | NaN | 102.0 | NaN | Beautiful apartment | 2017-07-23 13:06:19.663975 | 52.342265 | 4.897126 | 0101000020E6100000B090B932A896134060C8EA56CF2B4A40 |
| 6 | 934060 | 1476 | 5037506 | Shared room | NaN | Amsterdam | NaN | Oostelijk Havengebied / Indische Buurt | 67 | 5.0 | 16 | 1.0 | NaN | 462.0 | NaN | LOTUS, Classic Dutch Saling Barge | 2017-07-23 13:06:09.988016 | 52.377552 | 4.930418 | 0101000020E61000005D70067FBFB813400B45BA9F53304A40 |
| 7 | 19590049 | 1476 | 132687356 | Shared room | NaN | Amsterdam | NaN | Westerpark | 2 | 0.0 | 2 | 1.0 | NaN | 414.0 | NaN | big boot Adam 04 | 2017-07-23 13:06:09.984748 | 52.375205 | 4.866117 | 0101000020E6100000DD09F65FE7761340D925AAB706304A40 |
| 8 | 5020280 | 1476 | 4059485 | Shared room | NaN | Amsterdam | NaN | Oud Oost | 2 | 0.0 | 2 | 1.0 | NaN | 222.0 | NaN | Bright modern appartment in East! | 2017-07-23 13:06:07.452609 | 52.357346 | 4.912887 | 0101000020E610000032C687D9CBA613409FAD8383BD2D4A40 |
| 9 | 15810783 | 1476 | 84978218 | Shared room | NaN | Amsterdam | NaN | Centrum West | 0 | 0.0 | 12 | 1.0 | NaN | 301.0 | NaN | CANAL BOATTOUR AMSTERDAM covered boat 1,5 hour | 2017-07-23 13:06:07.447989 | 52.386610 | 4.890128 | 0101000020E6100000FB03E5B67D8F13403D27BD6F7C314A40 |
| room_id | survey_id | host_id | room_type | country | city | borough | neighborhood | reviews | overall_satisfaction | accommodates | bedrooms | bathrooms | price | minstay | name | last_modified | latitude | longitude | location | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 18713 | 2763386 | 1476 | 14122005 | Private room | NaN | Amsterdam | NaN | Slotervaart | 118 | 5.0 | 2 | 1.0 | NaN | 36.0 | NaN | Comfortable SKY ROOM 12th floor | 2017-07-22 16:05:14.173175 | 52.361043 | 4.846134 | 0101000020E6100000792288F37062134091B932A8362E4A40 |
| 18714 | 19203256 | 1476 | 132265798 | Private room | NaN | Amsterdam | NaN | Bijlmer Centrum | 1 | 0.0 | 4 | 1.0 | NaN | 35.0 | NaN | NEW Stylish room, Ziggodome, AFAS LIVE, ArenA, RAI | 2017-07-22 16:05:14.168799 | 52.320049 | 4.955609 | 0101000020E6100000950D6B2A8BD213400A0F9A5DF7284A40 |
| 18715 | 19734178 | 1476 | 139135665 | Private room | NaN | Amsterdam | NaN | Osdorp | 0 | 0.0 | 1 | 0.0 | NaN | 30.0 | NaN | Cozy Apartment in Nieuw-West | 2017-07-22 16:05:14.166410 | 52.356702 | 4.792346 | 0101000020E61000003677F4BF5C2B13407A354069A82D4A40 |
| 18716 | 288967 | 1476 | 1501422 | Private room | NaN | Amsterdam | NaN | De Baarsjes / Oud West | 281 | 5.0 | 3 | 1.0 | NaN | 36.0 | NaN | BandB de Baarsjes Amsterdam | 2017-07-22 16:05:14.163973 | 52.361918 | 4.855507 | 0101000020E61000000DFFE9060A6C1340B8EA3A54532E4A40 |
| 18717 | 16685383 | 1476 | 5831960 | Private room | NaN | Amsterdam | NaN | Bos en Lommer | 5 | 5.0 | 2 | 1.0 | NaN | 30.0 | NaN | A nice bed in the attic of my 'palace'. | 2017-07-22 16:05:14.161714 | 52.379638 | 4.848829 | 0101000020E6100000E695EB6D33651340D0285DFA97304A40 |
| 18718 | 17789893 | 1476 | 47501089 | Private room | NaN | Amsterdam | NaN | Bijlmer Centrum | 10 | 5.0 | 3 | 1.0 | NaN | 32.0 | NaN | 1-3 pers. Cozy Rm AFAS Live, ArenA, ZIGGODOME | 2017-07-22 16:05:14.158963 | 52.319794 | 4.955638 | 0101000020E6100000684293C492D2134080BA8102EF284A40 |
| 18719 | 16877166 | 1476 | 67093870 | Private room | NaN | Amsterdam | NaN | Bijlmer Centrum | 6 | 5.0 | 4 | 1.0 | NaN | 24.0 | NaN | Modern Room by Arena, ZIGGO, HmH | 2017-07-22 16:05:14.151986 | 52.319080 | 4.954822 | 0101000020E61000005801BEDBBCD1134062670A9DD7284A40 |
| 18720 | 19859427 | 1476 | 29724632 | Private room | NaN | Amsterdam | NaN | Geuzenveld / Slotermeer | 0 | 0.0 | 1 | 1.0 | NaN | 38.0 | NaN | Private single room | 2017-07-22 16:05:14.149610 | 52.384028 | 4.838403 | 0101000020E61000002079E750865A1340C85F5AD427314A40 |
| 18721 | 17132164 | 1476 | 115156569 | Private room | NaN | Amsterdam | NaN | Centrum West | 13 | 4.5 | 2 | 1.0 | NaN | 36.0 | NaN | City Center studio in Touristic Amsterdam 1 | 2017-07-22 16:05:14.146183 | 52.372120 | 4.890982 | 0101000020E6100000774CDD955D9013400118CFA0A12F4A40 |
| 18722 | 7605782 | 1476 | 39503013 | Private room | NaN | Amsterdam | NaN | Centrum West | 113 | 4.5 | 2 | 1.0 | NaN | 35.0 | NaN | I have a room available for rent | 2017-07-22 16:05:12.257054 | 52.381392 | 4.899658 | 0101000020E6100000CD565EF23F9913405F7AFB73D1304A40 |